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Abstract —We consider the locality of encoding and decoding 
operations in distributed storage systems (DSSh and propose 
a new class of codes, called locally encodable and decodable 
codes (LEDC), that provides a higher degree of operational 
locality compared to currently known codes. For a given locality 
structure, we derive an upper bound on the global distance and 
demonstrate the existence of an optimal LEDC for sufficiently 
large field size. In addition, we also construct two families of 
optimal LEDC for fields with size linear in code length. 

I. Introduction 

Motivated by practical applications in data storage and 
communication, locality has become an increasingly important 
notion in the study of erasure codes. A code with some 
locality property allows certain operations to be performed 
by accessing only a portion of a codeword or a data vec¬ 
tor. As a result, signihcant amount of resources (bandwidth, 
computational power, memory) can be saved. We list below 
some representative families of codes with certain locality 
properties. 

• Locally decodable codes were proposed by Katz and 
Trevisan m, which allow a single symbol of the original 
data to be decoded with high probability by only querying 
a small number of coded symbols of a possibly corrupted 
codeword. 

• Locally testable code was brst systematically studied by 
Goldreich and Sudan Q. For such a code, there exists 
a test that checks whether a given string is a codeword, 
or rather far from the code, by reading only a constant 
number of symbols of the string. 

• Locally repairable codes (LRC) (see e.g. El, El) were 
tailored-made for distributed storage systems (DSS), 
which guarantee that any erased coded symbol (corre¬ 
sponding to a failed storage node) can be reconstructed 
from a small subset of other coded symbols (correspond¬ 
ing to other surviving nodes). LRC with a given locality 
network topology were studied in El, IS- 

• Update efficient codes, also known as locally updatable 
codes (see e.g. ||7l, lH, 13), require updating as few nodes 
as possible whenever one data symbol is changed. 

• Coding with constraints iflOl . ifTTll . ifT^ . lfT3l . lfT4l . ifTSl 
requires that each coded symbol must be a function of a 
given subset of data symbols. In a classical code, each 
coded symbol can be a function of all data symbols. 

This work was completed when Han Mao Kiah visited Singapore University 
of Technology and Design. 
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Fig. 1: An example of an LEDC with two local subcodes. Here 
/Cl = {xi,a:2,a:3}, IC2 = {2:2,..., 0:5}, Afi = {ci,...,C 4 }, 
and A 2 = {c 5 ,..., cio}. The underlying held is Fy - the 
set of integers modulo 7. Coded symbols in each group A/i 
are encoded from the corresponding data symbols in Ki and 
conversely can be used to decode these data symbols. Each 
local subcode is an MDS code and moreover, the LEDC, 
which encodes k — |/Ci U /C 2 I = 5 data symbols into 
n = |A/i| + IA/ 2 I = 10 coded symbols, can reach the optimal 
minimum distance d = 4. In this example, the subcodes 
are encoded by simply using two Vandermonde matrices. In 
general, this naive construction yields LEDCs with optimal 
distances only in certain cases (see Theoremj^and Examplej^. 

Continuing along this line of research, we propose the 
class of locally encodable and decodable code (LEDC) that 
provides a higher level of operational locality in distributed 
storage systems (DSS) compared to currently known codes. 
In an LEDC, the set of n coded symbols (corresponding to 
n storage nodes) is partitioned into m disjoint subsets A/i’s, 
each of which is responsible for encoding and decoding of 
a given subset ICi of some data symbols. Each pair (A/i,/Ci) 
forms a maximum distance separable (MDS) code, referred to 
as a local subcode. The parameter of interest is the minimum 
(Hamming) distance of the global LEDC. An LEDC that has 
the largest minimum distance is called optimal. 
















We illustrate in Fig. [T] a toy example of an optimal LEDC 
with minimum distance d = 4. This LEDC consists of m = 2 
local subcodes. The first local subcode, which is a [4, 3] MDS 
code, encodes three data symbols K.i = {xi, 0 : 2 , 2 : 3 } into four 
coded symbols Ni = {ci, ..., C4}. The second local subcode, 
which is a [ 6 ,4] MDS code, encodes four data symbols IC 2 = 
{x 2 , ■ ■ ■, X5} into six coded symbols A /2 = {cs, ..., cio}. The 
underlying field is F 7 - the set of integers modulo 7. 

An LEDC distinguishes itself from the family of locally 
repairable codes in the following aspects. 

• Eirst, both encoding and decoding can be done locally 
for each local subcode of an LEDC - to create coded 
symbols in Mi, only data symbols from ICi are involved, 
and conversely, to decode data symbols in ICi, only coded 
symbols in Mi are involved. In contrast, the encoding 
operation in each local group in an LRC may involve all 
data symbols and only local repair is required, not local 
decodability. In other words, an LRC does not provide 
local encoding and decoding and only guarantees local 
repair, which can also be done by an LEDC. 

• Secondly, in the context of LEDC, the structure of the 
local groups (i.e. ICi’s and A/i’s) are given, whereas in 
the context of LRC, the size and the repair capability of 
each group are given as input. 

LEDC falls under the regime of coding with constraints as 
each coded symbol must be a function of a given set of data 
symbols. However, while LEDC provides local decodability, 
coding with constraints does not. Notice also that the notion of 
local decodability in the setting of LEDC is different from that 
in the setting of locally decodable codes (LDC) ||T]. Indeed, 
while an LEDC guarantees that each given subset of data 
symbols can be decoded (with probability one) from a specific 
subset of coded symbol (possibly under some errors/erasures), 
an LDC requires that each data symbol can be decoded 
(with high probability) from a small subset of coded symbols 
(possibly under some errors). 

A crucial feature of an optimal LEDC is that the repair ca¬ 
pability of the global system can be greater than its individual 
local systems. To illustrate this, we consider the example in 
Fig. 0 and regard the local subcodes as codes operating over 
independent storage systems. The first storage system utilizes 
an MDS code of minimum distance di = 2 and hence tolerates 
one node failure. On the other hand, the second storage system 
utilizes an MDS code of minimum distance 0(2 = 3 and hence 
tolerates two node failures. If the two codes are co-designed to 
form an optimal LEDC with optimal distance d = 4, then the 
LEDC provides extra protection for node failures: any three 
node failures across the two systems can be tolerated. 

The improvement in fault tolerance results from the fact that 
the two systems share some common data symbols {X 2 and 
X 3 in this toy example). Each system also has some private 
data symbols (xi in the first system and X4 and X5 in the 
second system). In normal condition where the node failures 
are within the local fault tolerance, each storage system can 
work independently to repair the failed nodes. No sharing of 
private data is required. However, in a catastrophic scenario 


where the number of node failures exceeds the local fault 
tolerance, the two systems can cooperate to repair the failed 
nodes by sharing the private data. 

Lurthermore, there exist LEDCs whose fault tolerances 
exceed the sum of the fault tolerances of their local subcodes. 

Example 1. Eor example, we can construct an LEDC with 
the following locality structure: 

Ml = {C1,C2,...,C5}, /Cl = {X1,X2,X3,X4}, 

J^2 = {Ce, C 7 , . . . , C 12 }, IC 2 = {X 2 ,X 3 ,..., X 7 }. 

Here, the local subcodes are [5,4] and [7,6] MDS codes 
that each tolerate up to one erasure. We demonstrate later in 
Example]^ that the LEDC can achieve a minimum distance of 
five and hence is able to tolerate up to four erasures. 

Our key results are summarized below. 

• We prove that for any given locality stmcture (i.e. IC/s 
and Mi’s), there always exists an optimal LEDC over 
any sufficiently large finite fields. The optimal minimum 
distance, however, can be determined in polynomial time. 

• When m = 2 (i.e. there are only two local subcodes), we 
provide constmctions for two families of optimal LEDC. 

(I) A straightforward construction using nested MDS 
codes for the case where |/Ci H /C 2 I is small. 

(II) An algebraic construction of LEDC as a (punctured) 
subcode of a cyclic code of length q—1, where q is the 
size of the finite field, in the case where |A/i — /Ci| = 
IA 2 — /C 2 I. 

Our paper is organized as follows. Necessary definitions and 
notation are provided in Section |I^ We prove the existence of 
optimal LEDC over sufficiently large finite fields in Section [III| 
Section is devoted for the construction of optimal LEDC 
over small fields when there are two local subcodes. 

H. Preliminaries 

Let Fq denote the finite field with q elements. Let [n] denote 
the set {1,2,..., n}. Eor a k x n matrix M, for i G [/c] and 
j G [n], let Mi and M[j] denote the row i and the column 
j of M, respectively. We define below standard notions from 
coding theory (for instance, see 061 ). 

The support of a vector u = (mi,...,u„) G F” is the 
set supp(m) = {i G [n] : Ui 7^ 0}. The (Hamming) weight 
of a vector m S F” is |supp(M)|. The (Hamming) distance 
between two vectors u and v of F^ is defined to be d(M, v) = 
[{/ G [n] : Ui ^ Ui}|. A fc-dimensional subspace ^ of F^ is 
called a linear [n, k,d]q (erasure) code over F^ if the minimum 
distance d{^) between any pair of distinct vectors in is 
equal to d. Sometimes we may use the notation [n, k, d] or 
just [n, k] for the sake of simplicity. The vectors in are 
called codewords. It is known that the minimum weight of a 
nonzero codeword in a linear code is equal to its minimum 
distance d('^). The well-known Singleton bound ( ifThl Ch. 1]) 
states that for any [n, fc, d], code, it holds that d < n — k + 1. If 
the equality is attained, the code is called maximum distance 
separable (MDS). 


A generator matrix G of an [n, k]q code is a A: x n matrix 
whose rows are linearly independent codewords of Then 
^ = {xG: X e Fq}. It is also well known that if d(^) = d 
then can correct any d — 1 erasures. In other words, x can 
be recovered from any n — d+1 coordinates of the codeword 
c = xG. An [n, k, d] code can be used in a DSS as follows. 
A vector x of k data symbols can be encoded into n coded 
symbols c = xG, each of which is stored at one node in the 
system. Then x can be recovered from any set of n — d + 1 
nodes. Hence the DSS can tolerate d — 1 node failures. 

Definition 1. Let n > k > 1 and m > 1 be some integers. Let 
Ki ,..., Km be m nonempty (possibly overlapping) subsets of 
[fc] such that [fc] = U™ Let A^i,..., Nm be m nonempty 
non-overlapping subsets of [n] that partition [n], i.e. NiHNii = 
0 if i ^ i' and [n] = U'^iNi. Suppose that rii = \Ni\ > 
\Ki\ = ki for every i G [m]. An {Ni,Ki}'^^-LKDC over an 
alphabet E is a mapping € : —> E" that maps a vector 

of k data symbols x = (xi,... ,Xk) € E^ into a vector of 
n = coded symbols c = (ci,...,c„) G E” and 

satisfies the following properties. 

(PI) The coded symbols in A/) — {cj : j G Ni} only depends 
on the data symbols in JCi = {xj : j G Ki} (Vi G [?ri]). 
(P2) The set of data symbols JCi can be determined from any 
subset of ki coded symbols of the set Afi (Vi G [m]). 

When E is a finite field for some prime power q and the 
mapping £ is linear, the corresponding LEDC is called linear. 
For linear LEDC, the mapping £ can be represented by a A: x n 
matrix G over F^ such that £(a;) = xG. Such a matrix G is 
referred to as a generator matrix of the LEDC. In this work 
we are only interested in linear LEDC. The second property 
(P2) in Definition [T] which must satisfy, states that each pair 
of Hi coded symbols in Afi and ki data symbols in ICi forms 
a linear [ni,ki] MDS code. We refer to these m MDS codes 
as the local subcodes of the LEDC. An example of a linear 
LEDC with two local subcodes over Fy is given in Fig. [T] 

III. Existence of Optimal Locally Encodable and 
Decodable Codes Over Large Fields 

We first discuss the closely related concept of coding with 
constraints. The upper bound on the minimum distance for 
a code with coding constraints jm, ina still applies in the 
setting of LEDC. The existence proof for optimal codes over 
large finite fields Ha Lemma 12], however, needs to be 
appropriately modified to take into account the new locality 
feature of LEDC. 

A. Coding With Constraints 

In the setting of linear coding with constraints m, im, 
ED, 113, 03, lEl, the data vector a; G F* is encoded into 
the coded vector c = xG G F” for some k x n matrix G in 
Fq, subjected to the following constraints: each coded symbol 
Cj is a function of a given subset of the data symbols indexed 
by Cj C [A:]. In a classical code, Cj = [A;] for all j G [n]. 
For i G [A:] let Ki = {j : i G Cj}. Then it is obvious that 
the support of the ith row of any valid generator matrix G 


of a code with coding constraints must be included in Ki. 
Similarly, the support of the jth column of G must be included 
in Cj . The following theorem presents an upper bound on the 
minimum distance of a code with coding constraints and states 
that an optimal code attaining this upper bound does exist over 
a sufficiently large finite field. 

Theorem 1. f/ ITTI Corollary 1 ], m Lemma 12]) Suppose that 
'if is a linear code that encodes the vector of data symbols 
a; G F^ into the vector of coded symbols c G F^ under the 
following constraints: each Cj is a function of a given subset of 
the data symbols indexed by Cj C [A;]. Let Ki = {j \ i G Cj}. 
Then 

d('^) < dmax = 1 + min {| Uie/- |/|}. (1) 

0^/C[fc] 

Moreover, when q is sufficiently large, there exists a code with 
minimum distance attaining this bound. 

Proposition 1. The upper bound dmax in Theorem can be 
found in polynomial time. 

Proof: Note that dmax is the largest d satisfying 

>d-l + |J|, f0fic[k]. (2) 

From the Singleton bound, d< n — k + f. Hence, we can find 
dmax by verifying ® for each d ranging from n — A: +1 down 
to 1. As long as can be verified in polynomial time for 
every d G [n — A; + 1], we can find dmax in polynomial time. 

Note that any d G [n —A: + l] can be written as d = (n—A: + 
1) — (5, for some 0 < S < n — k. Hence, (j^ can be rewritten 
as 

|U,e/i?,| >n-A:-(5+|/|, V0^JC[A:]. (3) 

In the proof of 041 Lemma 10], we provide a polynomial 
time algorithm to verify the so-called MDS Condition 

I Uig/ Ki\ fL tt — k \I\, V0 f I G. [A:], (4) 

where Ki,..., Kk are arbitrary nonempty subsets of [n]. We 
do so by creating a network with one source and k sinks and 
prove that 0 holds if and only if the capacity of a minimum 
cut between the source and any sink is at least n. 

Using exactly the same proof, we can show that ([^ holds if 
and only if the capacity of a minimum cut between the source 
and any sink of that network is at least n — S. As the capcity 
of such a minimum cut can be computed in polynomial time, 
0 can be verified in polynomial time and so can 0- ■ 

B. Optimal Locally Encodable and Decodable Codes Over 
Large Fields 

In this subsection we establish that an optimal LEDC always 
exists over a sufficiently large field. 

Theorem 2. Suppose that f is a linear {Ni, Ki}Tf.^-LEDC. 
Let rii = |Wi|, n = n-i, and k = \ CqLi Ki\. For i G [m] 

and j G Ni let Cj = Ki. For each i G [A:] let Ki = {j : i G 
Cj}. Then 

d('^) < dmax = 1 + min {|Uig/i?i|-|/|}. (5) 


Moreover, when q is sufficiently large, there exists a linear 
{-/Vi, Ki}^^-LEDC over with minimum distance attaining 
this bound. 

The upper bound simply follows from Theorem [T] Note 
that by Proposition this upper bound can be determined in 
polynomial time. We present below a proof of the existence of 
optimal LEDCs over large fields. We aim to show that when 
q is sufficiently large, there always exists a k x n matrix G 
over Fq that generates an {Ni, Ki}^^-LEDC with minimum 
distance attaining 0. 

Firstly, observe that if G is a generator matrix of an 
then by (PI), supp(G*) C R, and 
supp(G[j]) C Cj for all i G [fc] and j G [n]. Let G^ = 
(5ij)fexn where 

^i,j 1 j G Ri, 

0, otherwise, 

where are indeterminates. 

For any subset J C [n] of size n — dmax + 1 let — 
{y,S‘) be the bipartite graph defined as follows. The vertex 
set 'f' can be partitioned into two parts, namely, the left part 
L = jfi,... and the right part R = {rj : j G J}. The 
edge set is 

= {(^i, tj) ■ i G [k], j G J r\ Ri^. 

Then for every 0 ^ I C [k], the neighbor set of * G /} 
has size at least 

\Ni{i,:iGl})\ = \U,ei (Jni?,)| 

= |Jn(U,eji?,)| 

= \{u,^jR,)\{[n]\j)\ 

> |( Uie/ Ri) \ - (n - I J|) 

^ (^max “f 1) {tT. dniax “t” 1)) 

= |/|- 

Hence, according to the famous Hall’s marriage theorem, there 
is a matching of size k in In other words, there exist k 
distinct indices ji,... ,jk in J such that ji G Ri for all i G [k]. 

For each subset J C [n] of size n—dmax+l, we consider the 
submatrix P'^ of G^ that consists of the columns indexed by 
ji,..., jfe as discussed above. Then the determinant det(P'^), 
which is a multivariable polynomial in Fq[..., j,...], is 
not identically zero. The reason is that in the expression of 
det(P'^) as a sum of monomials, there is one monomial that 
cannot be canceled out, namely nt -1 

5^"'= n det(P^)GFJ...,e..„...]. 

JCH 

I I — ^ ~ <^max + l 

Then is not identically zero. Roughly speaking, the 

polynomial captures the locality structure {Ni, 
of the LEDC and the desired minimum distance dmax- 

Regarding the locality for decoding, i.e. every ki coded 
symbols in Mi can be used to recover all data symbols in 


K-i, let 

m 

^MDS^nndet(g,)GF,[...,e, 

Qi 

where the second product is taken over all ki x ki submatrices 
Qj that consist of ki rows of G^ indexed by Ki and some 
ki columns of G^ among those indexed by Ni. By definition 
of G^, each of such matrices has a nontrivial determinant. 
Therefore, is not identically zero. 

Thus, 

^ X ^ 0. 

Therefore, by ifTTl Lemma 4], for sufficiently large q, there 
exist gij G F^ such that . ,gij,...,) ^ 0. Let G = 
(5*j) (fo'" where j ^ Ri, simply let = 0). Since 
^ 0, the linear code generated by G has minimum 
distance at least dmax- Moreover, since 7 ^ 0, the 

coded symbols in each Mi form an [rii, ki] MDS code. Hence, 
G generates an optimal LEDC. We complete the proof of 
Theorem |2l 

IV. Optimal Locally Encodable and Decodable 
Codes With Two Local Subcodes 

In this section we restrict ourselves to the case m = 2, i.e. 
when the LEDC has exactly two local subcodes. We provide 
two constructions of optimal LEDCs over fields of sizes linear 
in n. 

Without loss of generality, we consider the following local¬ 
ity structure: 

Ki = {Ij 2 , ..., fci}, K 2 = {ki — t + 1 , ki — t + 2 ,..., k}, 
Ni = ( 1 , 2 ,... ,ni}, N 2 = {ni + l,ni + 2,... ,n}. 

Here t = \Ki n K 2 \ is the number of common data sym¬ 
bols shared by the two local subcodes. For brevity pur¬ 
pose, we denote an LEDC with this locality structure as an 
[ni, fci; 712, A: 2 ; f]-LEDC. 

When t = k, i.e. Ki = K 2 = [k], there is no locality in 
encoding, as both subcodes use all of the data symbols in their 
encoding process. An optimal LEDC is simply an [n, k] MDS 
code with minimum distance d = n — k + 1. In the remainder 
of this section, we always assume that t < k. The following 
is a straightforward corollary of Theorem 

Corollary 1 . Suppose that t = \KiC] -A2I < k. Additionally, 
assume that either t < min{A:i,fc 2 } or rii — ki = 712 — ^ 2 - 
If there exists a linear [ni,ki;n2,k2',t]-LEDC with minimum 
distance d then 

d <1 + t -\- minjTii — fci, 712 — ^2}- ( 6 ) 

The upper bound (|^ reflects the fact that the more common 
data the local subcodes have, the larger the global minimum 
distance of the LEDC. In one extreme case when f = 0, i.e. 
the two subcodes share no common data symbols, the global 
minimum distance is equal to one local minimum distance, 
whichever smaller. Hence, the global LEDC offers no further 
protection against erasures to each local subcode. When t 


is sufficiently large, however, the global LEDC can offer 
considerable amount of additional protection against erasures. 

We henceforth assume, without loss of generality, that ni — 
ki <712 — ^ 2 . Then (|^ can be rewritten as 

d<l+t + ni — ki. (7) 


We aim to construct optimal LEDCs whose minimum dis¬ 
tances meet the upper bound (|7]i. 

In what follows, we consider a linear [ni,ki;n 2 ,k 2 ',t]- 
LEDC and describe via its generator matrix G. Erom 
the local encoding property, we may write G as 



77 i 

n 2 



/ 

u 

0 N 

. ki-t 


A 

B 

1 ^ 

( 8 ) 

\ 

0 

V / 

k 2 -t 



where U, A, B, and V are {ki — t) x rii, t x rii, t x 712 , 
and (^2 — f) X 77,2 matrices, respectively. Erom the properties 
of an LEDC, the linear code ^ with generator matrix = 

^ is an [ni, fci] MDS code and the linear code ^ with 


generator matrix G^ = 


B 

"V 


[ 772 ,^ 2 ] MDS code. 


A. Optimal LEDCs from Nested MDS codes 

Our first construction uses pairs of nested MDS codes. More 
formally, for 1 < fc' < fc < 77, the pair is a pair of 

nested {k',k;n) MDS codes if C ^ and ‘rf' and ‘rf are 
[? 7 , k'] and [77, k] MDS codes, respectively. 


Theorem 3 (Construction I). Suppose 772 — fc 2 + 1 > f and 
there exist pairs of nested [ki — t,ki; ni) and {k 2 — t, k 2 ] 772 ) 
MDS codes. Then there exists an optimal [ 771 , fci; 712 , A: 2 ; f]- 
LEDC ^ whose minimum distance is rii — ki + t 1. 


Proof: Let , s/) be the pair of nested (fci —t,ki\ni) 
MDS codes with generator matrices U and ^ 


let 


. Similarly, 


be the pair of nested (k 2 — t,k 2 ',n 2 ) MDS codes 


with generator matrices V and 


B 

V" 


Then we consider a typical nonzero codeword c = 
{xi,X 2 ,X 3 )G, where Xi = (xt : i € Ki \ K 2 ), X 2 = 
(xi : i & KxC K 2 ) and X 3 = {xi : i G K 2 \ Ki). Hence, 
c = {xiU + X 2 A,X 2 B + X 3 V). We have the following cases. 

(i) When X 2 = 0 and £Ci f 0, then the first tti coordinates 
tCiG of c is a nonzero codeword from and has weight 
at least ni — ki + t + 1 . 

(ii) When X 2 = 0 and X 3 f 0, then the last 772 coordinates 
X 2 V of c is a nonzero codeword from and has weight 
at least 772 — ^2 + f + 1 > tii — fci -f f -f 1 . 

(iii) When X 2 f 0, then c consists of nonzero codewords 
from si and SS. Hence, c has weight at least (771 — ki + 
1) + (772 — A :2 +1) > 77i — fci 1, because we assume 


n2 — k2 + I > t- 


Therefore, the minimum distance of is 771 — /ci + f + 1, 
achieving the upper bound in Q. ■ 


Pairs of nested MDS codes were studied by Ezerman et 
al. E] in the context of quantum codes. Specifically, pairs 
of nested MDS codes of all possible parameters (assuming 
the MDS conjecture) were constructed and in general, a 
pair of {k',k;n) MDS codes exists if 77 < g. Therefore, 
max{ 77 i, 772 } < q suffices for Construction 1. The LEDC 
in Eig [T] is, in fact, constructed using two pairs of nested 
codes over F 7 , generated by Vandemonde matrices. Note that 
a Cauchy matrix also generates a pair of nested MDS codes, 
while requires a larger field size than a Vandermonde matrix. 

When the assumption in Theorem does not hold, i.e. t > 
1 + max{ 77 i — A:i, 772 — ^ 2 }, Construction I may fail to produce 
an optimal LEDC. We demonstrate this fact below. 


Example 2. Let Ki = {1,2,3,4}, K 2 = (2,3,4,5}, and 
ni = 772 = 5. In this case, t = \Kir)K 2 \ = 3 > l+max{ni — 
fci, 772 — A: 2 }. Hence, the assumption of Theorem]^ is violated. 
In fact, using two Vandermonde matrices over F 7 for the local 
subcodes results in the following generator matrix: 


/I 1 1 1 1 0 0 0 0 0\ 


G = 


12 3 4 
14 2 2 


11111 
1 2 3 4 5 
4 2 2 3 


116 16 1 
yo 0 0 0 0 1 1 6 1 6 / 

This matrix generates an LEDC of distance d = 4, which does 
not reach the upper bound Q. Hence, it is not optimal. 


B. Algebraic Construction of Optimal LEDCs 

We aim to construct optimal [tii, fci; 772 , ^ 2 ; t]-LEDCs 
whenever 777 -f 772 < g — 1 and r = ni — fci = 712 — ^2- Such 
LEDCs have minimum distance achieving the bound Q, i.e. 
d = r + t P 1. The generator matrices G of such optimal 
LEDCs have the block form given in (|^. In what follows, we 
find appropriate field elements {uj,Vj : 0 < j < r + t — 1}, 
: I < £ <t,0 < j < r + t - £}, {bf^ : I < £ <t,0 < 
j < r + £ — 1} and construct four component matrices of G, 
namely U, V, A, and B in the following forms. 


U = 


A = 


B = 


V = 


/ Mo 

Ml 



Ur+t 

0 


0 

Uq 

Ml 


Ur+t — 1 

Vr-\-t ' ' ' 

0 

• 

V 0 

0 


Mo 


■ ■ ■ —1 


/ 0 

0 

•• 0 

Uq 



“r + t-1 \ 

0 

0 

0 

0 



«r+t-2 

V 0 

0 

•• 0 

0 

0 

• • • • • • 

a'r'’ / 

/ 0 

0 

0 

0 


0 • • • 

\ 

0 

0 

0 

0 


^(2) . . . 

Oq 

^(2) 

^r+1 

V 0 

0 

•• 0 

bit) 



^r+t-1 / 

( ° 

0 


Vo 


■ • ■ Vr-\-t — l 

Vr+t \ 

0 

Wo 

Wl 


Vr-\-t — l 

Vr-\-t ' ■ ' 

0 

\ Wo 

Wi 



Vr-\-t 

0 

0 / 
























In this subsection, we demonstrate the existence of held ele- 
ments u/s, u, ’s, a\ , and h\ such that the above construction 
yields a generator matrix G of an optimal LEDC. In particular, 
we prove the following theorem. 

Theorem 4 (Construction II). Suppose ni — ki = n 2 — 
k 2 = r and ni + n 2 < q — 1. Then there exists an 
optimal [ni,ki;n 2 ,k 2 ',t]-LEDC whose minimum distance 
is r + t + 

To derive the distance properties, our strategy is to show that 
the linear codes , 2 /, iTS, and are subcodes of certain cyclic 
codes ifThl Chap. 7]. To this end, suppose that <? — 1 > ni + 
712 and identify a vector (cq, Ci,..., Cq_ 2 ) € with the 

polynomial CjX^ in the quotient ring Fq[a ;]/— 1). 

Note, however, that while the cyclic codes that we consider 
are of length q—1 over Fg, our codes jz/, SS, and may have 
shorter lengths, namely rii, 712 , and n, respectively. However, 
we can regard , SS, and 'io as codes of length g — 1 by 
simply adding a sufficient number of zero coordinates to the 
right of each codeword. Doing so obviously does not affect 
the polynomial representation of each codeword. Thus, from 
now on we can treat , SS, and ^ as subspaces of F®“^. 

Under the mapping of vectors to polynomials, let 
(uq, Ml,..., Ur+t) and {vq, Vi, , Vr+t) Correspond to u{x) 
and v{x), which are polynomials of degree at most r + t, 
respectively. For 1 < £ < f, let (^aQ\ af\ ..., 

and ..., correspond to a^^\x) and b^^\x), 

which are polynomials of degrees at most r+t—£ and r+f—1, 
respectively. 

Furthermore, the codewords in the linear codes and 

in their polynomial representations, can be obtained via 
linear combinations of certain polynomials described by u{x), 
v{x), a^^\x), and b^^\x). Specihcally, 

1) is the vector subspace of F|“^ spanned by the poly¬ 
nomials x'^u{x) for 0 < i < ki — t — 1, x^v{x) 
for 771 < j < 77i -f fc 2 — f — 1, and c^^^x) = 

-f a;”i+'==-^6(^)(x) for 1 < f < f, 

2) jz/ is the vector subspace of F^“^ spanned by the polyno¬ 
mials x'‘u{x) for 0 <i< ki — t—1 and 

for 1 < f < f, 

3) is the vector subspace of F|“^ spanned by the poly¬ 
nomials x^v{x) for 771 < j < ni + k 2 — t — 1 and 
a;"i+'= 2 -^ 6 (U(a;) for 1 < f < f. 

Next, we provide sufficient conditions for the polynomials 
u{x), v{x), a^^'i{x), and b‘^^\x) so that the codes sz^, and 
^ have the desired dimension and distance properties. 

Proposition 2. Let oj be a primitive element in Fg. Suppose 
that the polynomials u{x), v(x), a^^^x), and b'^^\x), 1 < < 

t, satisfy the following conditions. 

(i) (i) 

(Dl) uq, Vq, ttg ^ and 6 g are nonzero for 1 < (. <t. 

(D2) u{uj^) = v(uj^) = 0 for 0<j<r + t— 1. 

(D3) For 1 < £ <t, = 0 for 0 < j < 

r — 1. 


(D4) For 1 < i < t, = 0 for 0 < j < r + t — 1 , 

where c^^\x) = + x'^^+'^^-^b^^\x). 

Then the LEDC defined as above is an optimal 
[771, fci; 772, A:2; t]-LEDC of minimum distance r -f f -f 1 . 

To prove this proposition, we employ the well-known BCH 
bound on minimum distance of a cyclic code. We hrst recall 
some necessary notations from ifThl Ch. 7]. Let C be a cyclic 
code of length g — 1 over Fg. An element z G Fg is called a 
zero of C if c(z) = 0 for every codeword c{x) G C. Let Z be 
the set of all zeros of C. The polynomial g{x) = 
is called the generator polynomial of C. Then c{x) G C if and 
only if g(a;)|c(a;). 

Theorem 5 (BCH bound). Let oj be the primitive element 
o/Fg and r be an integer. The cyclic code with the generator 
polynomial {x—l){x—uj) ■ ■ ■ (x—uj^~^) has minimum distance 
at least r -f 1. 


Proof of Proposition^ First observe that from condition 
(Dl), the matrices G given in ([^, G^i/ = ^ ^ ^, and Ggg = 

B \ 

all have full rank. Therefore, the corresponding linear 


codes szZ, and fS have the desired dimensions. For the 
distance properties, we make the following two claims. Recall 
that we may regard ^, szZ, and as codes of length g — 1 by 
adding a sufficient number of zero coordinates to the right of 
each codeword. 


(Cl) ^ is a subcode of the cyclic code with generator poly¬ 
nomial gi{x) = {x — l)(a: — uj) - ■ ■ {x — 

(C2) and PS are subcodes of the cyclic code with generator 
polynomial g2{x) = (x — l)(a; — uj) ■ ■ ■ (x — 

Then by the BCH bound, the codes and have 

minimum distance r + t + l, r -\-l and r -f 1, respectively and 
the proposition is immediate. 

Hence, it suffices to prove the claim. From conditions (D2) 
and (D4), we deduce that 


u{uj^) = v{uj^) = = • • • = 0 < j < r+t—1. 


Therefore, gi{x) = {x — l){x — oj) ■ ■ ■ {x — 0 ;’'+*“^) divides 
all polynomials in the basis of Hence, (Cl) follows. 

It can be also verihed that g 2 {x) = {x — l){x — oj) ■ ■ ■ {x — 
oj'^~^) divides all polynomials in the bases of xP and 
respectively. Hence, (C2) follows. ■ 

The following proposition shows that the polynomials sat¬ 
isfying conditions (D1)-(D4) in Proposition]^ do exist. Hence, 
Theorem |4| follows. 

Proposition 3. There exist polynomials u{x), v{x), a^^\x), 
and b^^\x) for 1 < £ <t that satisfy conditions (D1)-(D4). 

To prove this proposition, we consider the following lemma. 

Lemma 1. Fix l<£<t, r>l and t — £ < T < q — £. 
Then there exists polynomials a(x) with degree at most t — £ 
and b{x) with degree at most £—1, having nonzero constants. 




such that 

a{uj^) + x^b{uj^) = 0 for r<j<r + t— 1. (9) 

Proof: Let a{x) = 

Then (|^ is equivalent to the linear system of equations 
M{ao, ai,..., Ur-e, bo, bi,..., = 0 , where M is the 

f X (f + 1) matrix: 


/1 ... 


... 


\ 

1 ■ ■ • 

(^r + l)t-C 

... 

(^,+1)T+£-1 


V 1 


(a;’'+‘-^)^ ... 


/ 

Then M 

can be rewritten as 



( 1 ... 


{uj-^Y ... 

(^T+^-l)r 

\ 

1 ... 


(a;^)’'+i . . . 

^^T+e-ly+l 


V 1 ... 


{uFY^"~^ ■ ■ ■ 

^^T+e-ly+t-1 

/ 

Since t - 

- f < T and 

T + £-l < q 

— 1 , the values 

1 


uP", ..., are distinct nonzero elements 

in Fq. Therefore AT is the generator matrix of a [f + 
l,f,2] MDS code. Hence, (oq, oi,..., at_^, 6 o, &i,..., 6 ^_i) 
belongs to its dual code. Since the dual code is a [f + 
1, l,f + 1] MDS code, which has minimum distance f + 1, 
we can choose {aQ,ai,... ,at-e,bo,bi,... ,be-i) such that 
oo, ai,..., at-e, bo,bi,..., are nonzero. In particular, oq 
and bo are nonzero. Thus, a{x) and b{x) are the desired 
polynomials. ■ 

Proof of Proposition^ Clearly, setting u{x) = v{x) = 
{x — l)(x — oj) ■ ■ ■ {x — w satisfies conditions (Dl) and 

(D2). 

For 1 < £ < f, let ai^\x) and bi^\x) be the polynomials 
obtained from Lemma [T] by setting T = {ni + k 2 — i) — 
{ki — t + i — 1). (It is obvious to verify that such T satisfies 
t — £ < T < q — i for Sill I < £ < t.) Then we set 

= (x — l)(x — uj) ■ ■ ■ (x — uj’'~^)ai^\x), 

b^^\x) = {x — l)(x — Lu) ■ ■ ■ {x — uj^~^)bi^\x). 

We can then verify that conditions (Dl), (D3) and (D4) are 
satisfied and this completes the proof. ■ 

Remark 1. For 1 < £ < t, observe that the polynomials 
ai^\x) and bi^\x) are found by solving t linear equations in 
t + 1 variables in Lemma [T] Therefore, the codes ‘rf, sP and 
SS can be constructed in time polynomial in q = 0(ni + nf) 
and t. 

Example 3 (Example [T] continued). We provide the optimal 
LEDC in Example [T] using Construction IT In this case, Ki — 
{1,2, 3,4}, K 2 = 12,3,4,5,6,7}. Hence, ki = 4, ^2 = 6 , 
k = 7, and f = 3. Moreover, ni = 5, n 2 = 7, and hence, 
r = ni — ki = n 2 — k 2 = f- Let g = 13 and w = 2 be the 
primitive element of F 13 . Since r + f — 1 = 3, we set 

u{x) = v{x) = (x — l)(x — 2 )(x — 2 ^)(x — 2 ^) 

= 12 + lOx + 5x^ + llx^ + x^. 


Repeated applications of Lemma [T] yield the polynomials 

= 12 + 2a; + 5a;^ + 7®^, = 8 + 5®, 

a*'^^(®) = 12 + 7® + 7®^, b^^\x) = 7 + 2® + 4®^, 

= 12 + X, b^^\x) = 10 + 12® + 3 ®^ + ®®. 


Hence, the optimal [5,4; 7, 6 ; 3]-LEDC is given by the gen¬ 
erator matrix 


/ 12 

10 

5 

11 

1 

0 

0 

0 

0 

0 

0 


0 

12 

2 

5 

7 

0 

0 

0 

0 

0 

8 

5 

0 

0 

12 

7 

7 

0 

0 

0 

0 

7 

2 

4 

0 

0 

0 

12 

1 

0 

0 

0 

10 

12 

3 

1 

0 

0 

0 

0 

0 

0 

0 

12 

10 

5 

11 

1 

0 

0 

0 

0 

0 

0 

12 

10 

5 

11 

1 

0 

V 0 

0 

0 

0 

0 

12 

10 

5 

11 

1 

0 

0 / 
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